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ESTIMATION  OF  SENSOR  DETECTION  PROBABILITIES  WITH 
DATA  FROM  CONCURRENT  SENSORS 

by 
D.  R.  Barr 

Abstract 

When  several  sensors  are  concurrently  scanning  the  same 
domain  for  signals,  varying  numbers  of  sensors  may  detect  each 
signal.   On  some  occasions,  a  signal  may  not  be  detected  by 
any  of  the  receivers.   Using  detection  data  collected  from  all 
the  receivers  over  a  period  of  scanning,  it  is  possible  to 
estimate  the  total  number  of  signals  that  occurred  in  that 
period  (including  those  that  were  not  detected  at  all) ,  as 
well  as  the  detection  probabilities  for  the  individual  receivers 
Several  estimators  for  these  quantities  are  developed,  in  the 
contexts  of  several  models  concerning  the  signal  generation 
process  and  the  receiver  behavior. 

Key  words:   detection  probability,  sensor  data,  intraclass 
correlation,  method  of  moments,  least  squares,  maximum 
likelihood. 


1.   Introduction 

The  problem  we  consider  concerns  estimation  of  per- 
formance characteristics  of  sensors,  based  on  historical  sensor 
reception  data.   We  shall  imagine  the  sensors  are  "receivers", 
passively  listening  for  "signals".   If  only  one  receiver  is 
turned  on,  an  operator  might  believe  his  receiver  is  detecting 
all  of  the  signals  that  occur  over  time;  he  would  have  no  data 
indicating  otherwise.   If  a  second  receiver  were  turned  on,  to 
operate  concurrently  with  the  first,  the  operator  might  notice 
that  sometimes  one  of  the  receivers  detects  a  signal  the  other 
receiver  does  not,  and  vice  versa.   This  implies  there  are 
signals  occurring  that  neither  receiver  detects,  and  the  de- 
tection probabilities  for  the  receivers  are  less  than  unity. 

Suppose   k   receivers  operate  concurrently  and  a  record 
is  kept  of  the  signals  detected  by  each  one.   It  is  assumed 
that  the  times  of  detection,  and  perhaps  the  nature  of  the  sig- 
nals detected,  makes  it  possible  to  determine  when  detections 
on  several  receivers  are  detections  of  a  common  signal.   It  is 
convenient  to  imagine  the  data  collected  from  the  detection 
history  with  k  receivers  are  in  the  form  of  k-dimensional 
vectors,  one  vector  for  each  signal  detected.   The  components 
of  such  a  data  vector  are  zeros  (for  "non-detect")  and  ones 
(for  "detect") ,  each  component  corresponding  to  a  particular 
receiver.   Thus,  for  example,  the  sum  of  components  of  a  data 
vector  for  a  given  signal  would  be  the  number  of  receivers  that 
detected  that  signal.   Note  this  sum  must  be  at  least   1,  since 


otherwise  one  would  have  no  knowledge  a  signal  occurred  and 
hence  there  would  be  no  related  data  vector.   Of  course,  signals 
may  occur  that  no  receiver  detects;  we  will  be  interested  in 
estimating  how  frequently  this  occurs . 

We  wish  to  use  the  set  of  data  vectors  observed  over 
some  period  of  time  to  estimate  detection  probabilities  of  the 
receivers,  and  perhaps  to  estimate  characteristics  of  the  sig- 
nal process,  such  as  the  distribution  of  signal  strength  and 
the  total  number  of  signals  occurring  in  the  period  in  question. 
Various  estimation  principles,  such  as  maximum  likelihood, 
method  of  moments  and  least  squares,  can  be  used  in  the  context 
of  various  models  for  the  signal  detection  process.   Thus  there 
are  a  number  of  candidate  models  and  estimators  which  could  be 
used  for  the  problem  we  have  described.   The  purpose  of  this 
report  is  to  develop  several  estimators  that  could  be  used  for 
this  problem;  a  later  report  will  compare  their  behavior  under 
varying  conditions  of  the  signal  process.   We  also  give  a  brief 
review  of  the  literature  related  to  this  problem  and  discuss 
associated  topics  such  as  intraclass  correlation  of  the 
receivers  and  possible  effects  of  dependence  among  the  receivers 

2 .   Related  Literature 

The  problem  we  consider  has  some  similarities  to  the 
capture-tag-recapture  methods  used  to  estimate  wildlife  popu- 
lations.  The  initial  capture  of  an  animal  is  analogous  to  the 
detection  of  a  signal  by  one  receiver;  subsequent  recapture  of 
that  animal  is  analogous  to  detection  of  the  same  signal  by  a 


second  receiver.   The  presence  of  a  tag  on  a  recaptured  animal 
makes  it  possible  to  determine  whether  the  two  "receivers"  have 
"detected"  a  common  "signal".   There  is  a  well  established  lit- 
erature on  this  and  related  methods  of  wildlife  population  es- 
timation, and  these  methods  might  be  useful  for  the  case  of 
k  =  2   receivers.   (See,  for  example,  Cormack  (1972)  for  a 
review  of  capture-recapture  methods  and  a  good  bibliography  on 
the  subject.) 

Applications  of  the  methods  we  discuss  can  be  made  in 
a  wide  range  of  situations.   It  could  be  used  to  estimate  the 
number  of  errors  in  computer  software  which  is  checked  by 
several  independent  testors  ("receivers").   The  models  have 
been  used  in  connection  with  visual  scanning  of  film  containing 
images  of  particles  occurring  in  a  bubble  chamber,  with  several 
film  scanners  acting  as  "receivers"  (Sanathanan  (1972) ) .   The 
application  to  receivers  detecting  signals  was  investigated  by 
Knorr  (1979) ;  one  of  the  methods  we  discuss  below  is  developed 
in  Knorr' s  report.   I  am  indebted  to  Professor  Knorr  for  sug- 
gesting this  problem  and  for  providing  suggestions  and  support 
during  the  course  of  my  work  on  the  problem. 

3.   The  Model 

Suppose  signal  strength  varies  randomly  from  signal  to 
signal,  so  that  receiver   i   detects  signal   j   with  prob- 
ability  7Ti.  ,  independent  of  other  signals.   Let  the  random 
variable  V.   denote  an  index  of  the  "strength"  or  "detect- 
ability"  of  the   jth   signal,  where   0  <  V.  <  1   with 


probability   1.   Assume  the  probability   tt  .  .   is  related  to 

signal  strength  v.   through  some  receiver  specific  function 

* 
t.  .   That  is,   tt  .  .  =  t.  (v.)  ,  where   v.   is  the  outcome  on 

V.  .   We  suppose  the  receivers  are  conditionally  independent, 

in  the  sense  that,  given  V  =  v  ,  the  joint  probability  of  a 

given  vector  of  detect-nondetect  components  is  the  product  of 

the  marginal  probabilities  for  each  individual  receiver. 

Symbolically, 


k 
(1)         P[Z  =  z  |V  =  v]  =   tt   P[ZX  =  z1  |V  =  v] 

i=l 

k        i  ,   i 

=    TT    T  .  (V)  Z    (1  "  T  .  (V)  ) 

i=l   x  x 


where   Z   is  a  random  k-vector  whose  components   Z    represent 
the  individual  detect/non-detect  events  that  will  occur  on  the   ith 
receiver  and   z   and   z    denote  points  in  the  sample  spaces 
of   Z   and   Z   ,  respectively.   Thus   Z    is  a  Bernoulli  random 
variable  and   Z   is  a  vector  of  conditionally  (given  V)  inde- 
pendent Bernoulli  components.   For  an  arbitrary  (future)  signal 

strength  V  ,  let   II-   denote   x  •  (V)   and   tt  .  =  t  .  (v)   (i.e., 

l  i  i     i 

drop  the  second  subscript  on   tt  .  .   when  it  carries  no  information)  . 

Let   S   denote  the  sample  space  of   Z  ,  so   S   contains 
2    k-dimensional  vectors  of  zeros  and  ones.   (We  include  the 
zero  vector,  0 ,  in   S  ,  even  though  it  cannot  be  observed  as 


* 
A  list  of  the  symbols  used  in  this  report  is  given  in  the 

Appendix. 


an  outcome  on   Z  ,  for  reasons  described  above.)   The  uncondi- 
tional probability  of  observing  an  outcome   zeS   is  obtained 
by  integrating  (1)  with  respect  to  the  marginal  distribution 
F   of   V  : 


(2) 


=  P[Z  =  z]  =  /  P[Z  =  z|v]  dF(v) 
0 


=  E 


"  k 

TT 

i=l 


Ti(V) 


(1  -  Ti(V)) 


1-Z' 


where   "E"   denotes  expectation  with  respect  to  V  .   By  the 
conditional  independence  assumption  (1) ,  the  unconditional 
marginal  probability  the   ith  receiver  detects  a  randomly 
selected  signal  is 


(3) 


=  pCz1  =  l]  =  /' 


t  .  (v)  dF(v] 


One  could  consider  several  forms  of  the  detection 
functions  x,  for  receiver  i;  i=l,2,...,k,  including  the 
following : 

(a)  (constant  signal  strength)   t. (v)  =  I.      for  all   i 
This  is  the  case  studied  by  Knorr. 

(b)  (equal  receivers)   t. (v)  h  v   for  all   i  . 

fl   if   v  >  i. 

—     l 

(c)  (receiver  threshold)   t . (v)  = 

' 0   if   v  <  I . 

i 


where  I.      is  the  threshold  limit  for  receiver   i  . 

(d)   (fuzzy  threshold)   x. (v)  =    i.     •  v  ,  where  I. 
denotes  the  proportion  of  signals  of  strength  v   that 
receiver   i   would  detect. 


By  taking  special  forms  of  the  distribution  of  V   and  the 
constants  I.     ,  it  can  be  seen  that  (a) ,  (b)  and  (c)  are  all 
special  cases  of  (d) .   Thus,  (a)  results  with  V  degenerate 
at   1   in  (d) ;  (b)  results  with  I.     =  1  in  (d) ;  (c)  results 
with  I .  =  1   and  V   distributed  Bernoulli  in  (d) .   Thus, 
we  shall  consider  only  the  case  (d)  with  various  conditions 
on  the   L's   and  distributions  for  V  . 

For  each  zeS  ,  let  n  denote  the  number  of  times  z 
occurs  in  the  data  set  (i.e.,  the  number  of  times  this  partic- 
ular set  of  detects  and  non-detects,  among  the   k   receivers, 

occurred  during  the  time  period  in  question) .   Let   n   be  the 

k  z 

2  -dimensional  vector  whose  components  are  the   n  's   in  some 

* 
specified  order  ,  and  let   N   be  the  random  vector  with  outcome 

n  ,  where   n   varies  over   N's  sample  space,  say   T.   It  follows 

that  N   is  distributed  multinomial  with  "number  of  trials" 

parameter   s   and  "type   z   outcome"  probability  vector   p 

whose  components   p   are  given  in  eq.  (2) .   Note   s   is  the 

total  number  of  signals  occurring,  including  the  undetected 

signals,  so   s   is  unobservable.   Symbolically,  we  summarize 

this  by  writing 


* 

A  convenient  method  of  indexing  the   z's   is  to  regard  the 

components  of   z   as  forming  a  binary  number,  then  indexing   z 

with  the  decimal  equivalent  of  that  binary  number.   Thus,  for 

example,  0=  (0,0,0,. ..,0)  is  indexed  by   0  ,  (0,0,. ..,0,1,0,1) 

is  indexed  by  5  (since  101,  .      =  5,   .   -,),  and  so  on. 
2  binary    decimal 


N  ~  M  .  (Sfp)   , 
2K 


where  the   2    components  of   p   are  given  by 


1  k 

(2')        p    =   /    7T 

0  i=l 


U. 


v)*  (1  -  I. 


v 


1-z 


dF(v)  , 


and  may  depend  on  the   L's   as  well  as  parameters  in   F  (v)  . 
We  use  this  multinomial  formulation  in  the  maximum  likelihood 
estimation  approaches  discussed  below. 


4 .   Intraclass  Correlation 

It  may  be  the  case  that  the   k   receivers  operate 
physically  independently  of  one  another  and  yet,  because  they 
all  are  subjected  to  the  same  signal  "stimulus"  at  a  given 
instant  of  signal  occurrence,  the  receiver  detection  outputs 
are  statistically  dependent.   This  is  analogous  to  intraclass 
correlation  in  experiments  having  random  effects  (see  Scheffe 
(1959) ) .   We  are  assuming  receivers   i   and   j   detect  a 
given  common  signal  having  strength  v  with  probability 
(£.  •  v)  (£  .  •  v)  .   Recall   Z1   and   Z-1   are  Bernoulli  random 
variables  whose  outcomes  (O's  or  l's  )  designate  the  respective 
receiver   i   and   j   detection  or  nondetection  of  the  given 
signal  in  question.   We  shall  show  the  correlation  between   Z 
and   ZJ   can  be  (and  generally  is)  positive,  so  the  receiver 
detections  are  not  in  general  independent. 


Now 


(4) 


CovCZ^Z3) 


E  (Z1Z11)  -  E{Z1)E(Z^) 


=  l.l  .    /  vdF(v)  -  1. 1. 
i  1   i  ii 


/  vdF(v) 
0 


i  j   V 


where   a„   is  the  variance  of  signal  strength,   V.   Similarly, 
the  variance  of   Z    is 


V(ZX)  =EV(V(Z1|V))  +  VV(E(Z1|V)) 


E(£.V(l-Jl.V)  )  +  V(£.V) 
l      l  l 


l.E(V)     -  £2E(V2)  +  I2    V(V) 
l         l  l 


i^a-i^)   , 


where   yv   is  the  mean  signal  strength,  with  a  similar  expres- 
sion for  the  variance  of   Z3  .   It  follows  that  the  correlation 
of   Z1   and   Z-1   is 


(5) 


PU^Z3)  = 


aT7/£  .  I  . 
V   J-  3 


yv/(l-£iyv)  (1-*^) 


=  >  0 


If  signal  strength  is  constant,  say  degenerate  at   1,  then 


a   =  0   and  this  intraclass  correlation  is  zero.   On  the  other 


hand,  if  signal  strength  varies  among  signals,  then   p  >  0 
and  the  receiver  outputs  cannot  be  independent.   It  is  useful 
to  note  from  (3)  and  (4)  the  probability  that  receivers   i   and 
j   both  detect  a  given  signal  is 


(6) 


E(ZX-ZJ)     =    £i£.E(V2)     =    i±i . (a^    +    yv) 


n       n  2  i      j  i      j 

=     I  ■  I  .  0TT     +     TT     IT J      =     TT     7TJ 

l    j    V 


2) 
1    +q 

y2 

V 


the  latter  quantity  being  related  to  the  signal  to  noise  ratio 
of  the  signal,   V  . 

As  a  specific  numeric  example,  suppose  V  has  a  beta 

a 
distribution  with  parameters   a   and   1  ,  so  that   F (v)  =  v 

2  2 

for   0  <  v  <  1   and   a  >  0  .   Then   a   =  a/(a+l)  (a+2)   and 

y   =  a./(a+l)  ,  so  using  (5)  ,  we  have  for  this  case 


pCZ^Z11)  =  /IX/(a+2)/(l  +  aU-i^))  (1  +  a(l-£.))  . 


If   a  =  1  ,  so  V   is  distributed  uniform  over  (0,1)  then 

0  <_  p  <_  1/3,  depending  on  the  values  of  I.      and  I.      between 


0   and   1 .   ( p  ~ 


.33  if  the  Z's      are   1;  p  Z    .1   if  the   £'s 


are  .5).   If   a  =  100  (say),  the  distribution  of   V   is  nearly 

2  —5 

degenerate  at   1   (y   =  .99   and  a      =  9.6  x  10   ),  and  the 

intraclass  correlation  is  nearly  zero  (0  <_  p  <  .01). 

In  a  model  that  assumes  (and  exploits)  independence  of 

the  receivers,  the  presence  of  positive  correlation  can  be 


10 


expected  to  degrade  the  estimates  of   tt   and   s   (the  vector  of 
receiver  detection  probabilities  and  number  of  signals  present, 
respectively) .   The  nature  of  the  degradation  depends  on  the 
particular  situation  and  may  be  difficult  to  determine.   It 
can  be  expected  that  the  variance  of  the  estimators,  and  the 
mean  square  error  of  the  estimators,  will  increase  if  a  model 
is  used  wnich  ignores  the  receiver  dependence  if  it  is  in  fact 
present.   It  should  be  possible  to  get  some  idea  of  the  strength 
of  correlation  among  receivers  by  examining  certain  residuals. 
For  example,  suppose   tt   and   s   are  estimated  by   tt   and   s  , 
using  estimators  which  are  derived  assuming  receiver  independ- 
ence,  and  suppose   tt   and   s   are  used  to  estimate  the  expected 
number  of  signals  that  should  have  been  detected  by  exactly   j 
receivers,  for   j  =  0,l,2,...,k.   Define  the   jth   residual  by 

(7)      R(j)  =  (observed  number  of  signals  detected  by   j   receivers) 
(expected  number  of  signals  detected  by   j   receivers) 

If  one  finds   R(j)   tends  to  be  positive  for   j's   near   0   and 
near   k  ,  while   R(j)   tends  to  be  negative  for  intermediate 
j's  ,  this  suggests   p  >  0  .   In  such  a  case,  the  quality  of 
tt   and   s   may  be  questionable,  and  it  might  be  prudent  to  use 
alternate  estimators  which  accommodate  dependence  among  the 
receivers . 


11 


5.   Estimation  Approaches 

We  shall  discuss  several  approaches  to  deriving  estima- 
tors for   it   and   s  .   Most  of  these  require  numerical  minimi- 
zation techniques,  which  can  be  carried  out  with  any  one  of 
several  commonly  available  programs,  such  as  GRG  (Lasdon,  et  al 
(1975)).   The  approaches  fall  into  three  categories:   Method 

of  Moments,  Least  Squares  and  Maximum  Likelihood.   In  all  cases, 

2 
av  =  0  is  assumed  in  the  estimation  model,  except  as  noted  in 

the  Maximum  Likelihood  approach. 

(a)   Method  of  Moments  Estimators.   Let   T   denote 

the  number  of  signals  detected  by  receiver   i  .   Then   T    is 

binomially  distributed  with  parameters   s   and   tt   ,  so 

E(T  )  =  s  •  tt   ;  i  =  1,2,.  ..,k.   Thus   k   equations  in  the 

12       k 
k  +  1   unknowns   s,  tt  ,  tt  ,  .  .  .  ,rr    are  obtained  by  setting 

t   =  stt   ;  i  =  l,2,...,k,   where   t    is  the  observed  outcome 

on  the  random  variable   T1  .   To  obtain  a   k  +  1   st  equation, 

let 


T*  =  I       (number  of  signals  detected  simultaneously  by 
i<j   receivers   i   and   j) 


so 


E(T*)   =  S  I         TT1!^ 


-> 
x     2 


=  S    \         TT   TT-1   , 


2  A  A  •  A  ■ 

assuming   ay  =  0  .   Set   t*  =  s  \    tt  ttj   and,  together  with  the 

\  AA  •  1<"J  ^  ^     . 

k   equations   t   =  stt  ;  i  =  1,2,..., k,  solve  for   s   and   tt 
to  obtain 
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s  = 


L1^ 


/t*  ;  tt1  =  tit*/  I       titj 


(b)   Least  Squares  Estimators.   First,  let  us  consider 
the  method  developed  by  Knorr  (1979) .   Let   A   denote  the 
event  [Z  ^  £]  that  a  given  signal  is  detected  by  at  least  one 

r  i 

receiver.   Recall   t   denotes   /,z    (sum  over  all  data  vectors 
z) ,  the  total  number  of  signals  detected  by  receiver   i  ;  let 
t   denote  the  number  of  such  data  vectors  (i.e.,  the  total 
number  of  signals  detected)  .   Now   tt   might  be  estimated  by 


(9) 


TT1    -    t 

TT    =  — 


P(A) 


if   P(A)   were  known,  since   t  /t    estimates  the  relative  fre- 
quency of  the  event  [receiver   i   detects]   among  those  occa- 
sions for  which  the  event   A  =  [some  receiver  detects]  occurred 

k 
Similarly,  let   Z   =  I    Z    denote  the  number  of  detections 

i=l 
made  of  a  given  signal,  let   I-'   denote  the  number  of  data 

vectors  for  which   z   =  j  ,   and  let   p  •   be  the  mass  function 

Li 

of   Z    (positive  at   z   =  0,l,...,k).   Now   Pz*(j)#  the 
probability  of  exactly   j   detections  of  a  given  signal,  might 
be  estimated  (for  j  >  0)  by: 


(10) 


P  .(j)  =  —  '  P(A) 
Z        t 
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if   P  (A)   were  known,  since  again,   iVt*   is  a  relative  fre- 
quency which  estimates  the  conditional  probability 
P[Z*  =  j|A]  .   For  the  case   j  =  0, 


(11)  p  . (0)  =  1  -  P(A)  . 

Z 


Now  consider   P (A)   to  be  a  parameter  which  can  vary 

over  the  interval  (0,1) .   For  each  value  of   P (A) ,  and  a  given 

i     *       i 
data  set  determining   t   ,  t   and  IJ  ,  one  can  calculate 

p  #(j)   using  (10)  and  (11).   Alternatively,  one  can  estimate 

Z* 
the  corresponding  probabilities  by  first  estimating  the   it 

using  (9) ,  then  calculating  the  probabilities  as  functions  of 

the   tt  's   using  probabilistic  arguments  (for  example,  see 

Knorr,  1979) .   Denote  the  estimates  determined  by  this  latter 

approach  by   p  .(j)  .   Finally,  form  the  squared  error  quantity 
Z* 

k 
(12)  E2(P(A))  =  I       (p  . (j)  -  p  .(j))2  , 

j  =  0    Z         Z 


and  find  the  "least  squares"  value   0  <  P(A)  <  1  minimizing 

(12) .   This  minimization  can  easily  be  carried  out  numerically 

Once   P(A)   is  obtained,  the   it1   are  given  by  (9)  with   P  (A) 
in  place  of   P(A).   The  total  number   s   of  signals  present 

can  then  be  estimated  by   s  =  t*/P(A) . 

A  variation  of  this,  which  might  be  described  as  an 

overconstrained  method  of  moments  with  least  squares  fit,  is 

as  follows.   As  in  the  method  of  moments,  set 
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t   =  sir   ;  i  =  1 ,  2  ,  .  .  .  ,  k 


This  gives   2k   equations  in  the   k  +  1   unknowns   s  , 


i, 


tt  ,...,tt   ,  since  the   p  .(j)'s   are  functions  of  the   tt 

Z* 
(and  we  take   p   (j)   to  be  such  functions  of  the   t^'s)  .   We 

Z* 
estimate   s   and   tt   with  a  weighted  least  squares,  weighting 

each  expression  by  the  inverse  of  its  variance.   Now 

V(T1)  =  sTT^-d-TT1)   and   V(I])  =  sp  #  (j)  (1-P  .  (j)  )  ,  so  we  wish 

Z*        Z* 
to  find   s   and   tt   minimizing 

]  "  sp  .(j) 


Z 


E2(S/7T1,...,7Tk)  =   I         (tl  '  S7t1)-2  +   I        - 

iil  STT^l  -  TT1)      j£l  »P  .(J)C1  -  P  .(D)) 


This  minimization  can  be  carried  out  numerically,  using  the 
Generalized  Reduced  Gradient  non-linear  maximization  algorithm, 
for  example. 

(c)   Maximum  Likelihood  Estimators.   Recall  the   2  - 

2 

dimensional  vector   N  ,  whose  "zth"  component   N    denotes  the 
number  of  times   zeS   occurs,  is  distributed  multinomial, 


N  ~  M  .  (s,p)  , 
2K 


where  in  turn   z   is  a  k-dimensional  vector  whose  components 
indicate  detection/non-detection  of  a  given  signal  with  the 
corresponding   k   receivers.   Similarly,   p   is  a  2  -dimensional 
vector   whose   "zth"   component,   p   ,  denotes  the  probability 
of  observing  such  a  detect/non-detect  pattern  among  the  receivers, 
p   =  P[Z  =  z];  zeS.   Note  this  formulation  imposes  no  condition 
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on  independence  of  the  receivers,  although  actual  computation 
of  the  p  's  in  terms  of  individual  receiver  detection  prob- 
abilities (the  tt  's)  would  involve  such  considerations. 

Following  the  development  by  Sanathanan  (1972) ,  the 
likelihood  function   L  ,  evaluated  at  the  observation 

"1      O  O     T  1 

(n  ,n  ,...,n     )  ,  a  (2  -1)  dimensional  vector  (where  we  omit 
n   ,  the  number  of  signals  that  went  undetected,  since   z  =  0 
is  unobservable) ,  is 


(13) 


L(s,p)  = 


V 

1   2      2-1 
n    ,  n  ,  .  .  .  n 


(P  ) 


„k  ,)n2  _1f  0^s_t 


IP 


2-1 


0 
where   s-t   =  n   .   It  is  convenient  to  factor  (13)  into  a 

marginal  likelihood  for   s   times  a  conditional  likelihood  for 

n   given   s  : 


(14) 


L(s,p)  = 


(P  ) 


s-t 


(1  -  p1 


1     2k-l 
n  .  .  .n 


(c  ) 


(      k 

2-1 
c 


n 


2k-l 


=  Lx(s,p  )  x  L2 (p)  , 

where   c   =  p  /(1-p  )  .   Sanathanan  considers  both  the  maximum 
likelihood  estimators   s   and   p  maximizing  (13)  and 
"conditional  maximum  likelihood  estimators"   s   and   p  ,  where 
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p   maximizes  L~  (p)   and   s   maximizes   L, (s,p  )  .   In  this 
situation,  it  is  known  that,  once   p   has  been  found,  the 
solution  for   s   is  given  by     (t  /(1-p  )  ,  where     (x) 
denotes  the  greatest  integer  not  exceeding   x  . 

In  our  model,  with   p    given  in  equation  (2) ,  the 
conditional  va.Z.e.      approach  will  involve  maximization  of  the 
second  factor  in  (14)  over  a  (k+m)  -  dimensional  space,  where 
m   is  the  number  of  unknown  parameters  in  the  distribution  of 

signal  strength,   V  .   We  shall  concentrate  on  two  special 

2 
cases:   one  in  which   av  =  0  ,  so   V   can  be  considered  to  be 

degenerate  at   1  ;  and  one  in  which  V   is  distributed  Beta 
with  parameters   a  >  0   and   3  >  0  ,  as  in  our  earlier  example. 
In  either  case,  the  maximization  can  be  carried  out  with  a 
numerical  minimization  routine  such  as  GRG . 

Case  1:   V  degenerate  at   1  .   The  conditional  likeli- 
hood function   L- (p)   is  a  function  of   k   parameters, 

£,,...,£.  ,  since  in  this  case  the  receivers  are  independent 

k    i      ,   i 
and,  by  (2),   pz  =   H  lz    (l-l.)L    z    for   z  ^  0  .   (For 

i=l   x     L 

n  k 

z  =  0  ,  p   =  1  -   J   pz=l-   n  (l-l.))     .   Once  the   n  's 

z^O  i=l     X 

(z  ^    0)  are  observed,   t*  =  \      n    is  known,  and   L~   in  (14) 

z^O 


so 


is  a  function  of   £,,...,£,  .   Note  in  this  case   7T   =  I.     , 

the  individual  receiver  detection  probabilities  are  estimated 

by  the  I.     ,  and   s  =   "(t*/  tt   (l-£.))  .   The  numerical 

1  i=l     x 

maximization  of   L?   in  this  case  is  over  the  k-dimensional 


space  of  t . ' s  . 
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gives 


Case  2:   V  ~  Beta (a, 3)  .   In  this  case,  equation  (2') 


(15) 


1 


B(a,3) 


n 

n 

i=l 


oe.v)z  ci-^.v)1  z 
i  i 


a-l  ,,   , 
v     d-v] 


i-1 


dv 


Now  for  each  trial  set  of  l.'s      and  a    ,  3   in  the  numerical  max: 
mization  of   L~   in  (14) ,  the   p  's   can  be  obtained  from  (15) 
by  numerical  integration.   Such  numerical  integration  thus 
becomes  part  of  the  process  of  evaluating  the  objective  function 
at  each  stage  of  the  numerical  maximization;  this  maximization 
would  be  over  the  (k+2)  -  dimensional  space  of  the  t.  's   and 
a,  3  •   As  a  special  case,  if   3=1   and  the  receivers  are 
identical  (the  l.'s      are  the  same) ,  the   p  's   in  equation  (15) 
can  be  obtained  explicitly  as  an  incomplete  beta  function: 


f1 
pz  =    Uv)z  il-Zv) 

0 


k-z     a-l  , 
av     dv 


i 


[         (z'+a)-l  ,,  „    . (k-z"+l)-l 
u         (l-lv)  dv 


0 


a 


B(£;  z  +a,  k-z'+l)  , 


where   B(x;  a,b)  is  the  incomplete  beta  function  with  parameters 
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a   and   b  .   This  function  is  available  as  a  subroutine  in  many 
computer  installations,  so  a  separate  numerical  integration 
routine  need  not  be  called  in  this  special  case.   The  numerical 
maximization  of   L~   is  over  the  2-dimensional  space  {I, a.)       in 
this  case. 

6 .   Testing  the  Model 

A  chi-square  test  of  a  model  for  the  signal  detection 
process  can  be  made  in  cases  where  the  maximum  likelihood 
approach  is  used.   This  can  be  applied,  somewhat  more  approxi- 
mately, when  conditional  maximum  likelihood  approach  is  used. 
This  test  can  be  used,  for  example,  to  investigate  whether  the 
receiver  detection  probability  model  (t . (V)  =  Z.*V     with  V  ~  F 
and  "conditional  independence"  of  the  receivers)  is  plausible, 
in  light  of  the  observed  data.   In  particular,  with  V  degenerate 
at   1  ,  it  can  be  used  to  test  the  plausibility  of  the  assumption 
of  receiver  independence. 

The  test  is  conducted  as  follows.   Once  the   p   and   s 

have  been  estimated,  the  expected  number  of  outcomes  of  each 

z        ~z 
type   z   for   zeS   can  be  estimated  by   E (N  )  =  s  •  p   ,  and  the 

test  statistic 


t  =  I       (n   -  E(NZ))VE(NZ) 
z^O 


computed.   Under  the  hypothesized  model,   t   is  (approximately) 
an  outcome  from  a  chi-square  population  with  (2  -1)  -  (m+1) 
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degrees  of  freedom,  where  m   is  the  number  of  parameters  esti- 
mated (i.e.,  the  dimension  of  the  space  over  which  the  numerical 
maximization  was  conducted) .   As  usual  for  chi-square  "goodness- 
of-fit  tests,  if  some  of  the  E (N  ) 's   are  less  than  three  or  so, 
these  "categories"  should  be  pooled  with  others  so  the  three  or 
more  expected  count  rule  of  thumb  is  met,  with  corresponding  re- 
duction in  the  degrees  of  freedom  for  the  test  statistic.   The 
level  of  significance  of  the  test  is  the  tail  area  above   t  , 
under  the  appropriate  chi-square  density. 

7 .   Comparison  of  the  Estimators  by  Simulation 

CAPT  Dave  Hendrickxis  currently  conducting  a  series  of 
simulations  to  evaluate  and  compare  the  behavior  of  the  esti- 
mators described  above,  for  a  variety  of  data  input  streams 
related  to  the  general  model   t. (V)  =  £.*V  ,  with 
V  ~  Beta (a, 1)  .   By  varying  a      from   1   to   100  ,  varying 
degrees  of  intraclass  correlation  can  be  generated  among  the 
simulated  receivers.   As  mentioned  above,  with  a  ~  100  ,  the 
receivers  are  essentially  independent,  since  V   is  then  nearly 
degenerate  at   1  .   On  the  other  hand,  with   a  =  1   the  signal 
strengths  vary  at  random  over  (0,1),  and  there  is  appreciable 
intraclass  correlation  among  the  receivers.   Details  of  the 
simulation  and  analyses  of  the  results  will  be  published  in 
Hendrickx's  masters  thesis,  Hendrickx  (1981). 


20 


Preliminary  analyses  of  results  from  a  limited  number 
of  simulation  cases  suggest  the  following  conjectures  concerning 
the  three  estimation  procedures : 

1)  it  is  not  the  case  that  one  of  the  procedures 
dominates  the  others  over  all  situations; 

2)  generally  speaking,  the  maximum  likelihood  esti- 
mator has  smaller  variance  than  does  the  method  of  moments  or 
least  squares; 

3)  all  procedures  appear  to  underestimate   s   and  the 
ir.  '  s   when  the  receivers  are  made  dependent  by  taking   a   as 
small  as   1  ;  and 

4)  the  estimators  perform  generally  surprisingly  well, 
even  when  the  total  number  of  signals   s   is  as  small  as  25  or 
so,  without  regard  to  the  number   k   of  receivers  or  the  mix 

of   it  .  '  s  ,  when  the  receivers  are  (at  least  nearly)  independent. 

As  remarked  above,  these  are  preliminary  observations;  more 
details  will  be  documented  by  Hendrickx. 
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Appendix 

LIST  OF  SYMBOLS 
(in  order  of  occurrence) 

k:   number  of  receivers 

s:   number  of  signals 

tt  .  .  :   probability  receiver   i   detects  signal   j 

V.:   strength  of  signal  j  ;  V.  ~  F(v)  ,  E  (V)  =  y   ,  V  (V)  =  aZ.    . 

T.(V.)   =  TT  .  .  -     t  .       *   V. 

i  ]    13    1   : 

1      k 
Z  =  (Z  ,...,Z  )  :   vector  of  Bernoulli  detect/non-detect 

indicators,  for  a  given  signal,  for  all 

receivers . 

tt1  =  PEz1  =  1]  =  E(xi(V))  =  l±   •  yy  . 

S:   sample  space  of   Z  . 

p   =  P[Z=z];  zeS. 

n  :   number  of  times   zeS   occurs 

,01       2-l>  jt    z ,  .  - .  ,    , 

n  =  (n  ,n  , . . . ,n     ):   vector  of   n  's   in  some  specified  order; 

z ,        0 
n     rt  =  n 
1  z  =  0 

p  =  (p  ,p  ,...,p     ):   vector  of   p  's   in  same  order  as  for  n 

N:   random  vector  with  outcome   n  ;  N  ~  M  u(s,p) 

2K 

R(j):   the   jth   residual;  R(j)  =  (number  of  signals  detected 

by   j   receivers)  -  (expected  number 
of  signals  detected  by   j   receivers) . 

T  :   number  of  signals  detected  by  receiver   i   (outcome  is   t   ) 

T*:    I       (number  of  signals  detected  simultaneously  by  receivers 
i<j   i   and   j  ) 

A:   the  event  [Z  ^  0] 

s ,  tt  :   estimates  of   s   and   tt   ,  respectively 

t  :   number  of  signals  detected 

•     r  i  • 

Z   =  2.Z  :   number   of  detections  of  a  given  signal 

i 
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p   (j) :   mass  function  of   Z 
Z* 


3  . 


number  of  data  vectors   z   for  which   z 


L(s,p) :   likelihood  of   N   at   n 

i     i  ,  ,,   0, 
c   =  p  /il-p  ) 
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